Overview

Dataset info

Number of variables21
Number of observations21613
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory3.5 MiB
Average record size in memory168.0 B

Variables types

Numeric19
Categorical1
Boolean1
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

date has a high cardinality: 372 distinct values Warning
sqft_basement has 13126 (60.7%) zeros Zeros
view has 19489 (90.2%) zeros Zeros
yr_renovated has 20699 (95.8%) zeros Zeros

Variables

bathrooms
Numeric

Distinct count30
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.114757322
Minimum0
Maximum8
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11.75
Median2.25
Q32.5
95-th percentile3.5
Maximum8
Range8
Interquartile range0.75

Descriptive statistics

Standard deviation0.7701631572
Coef of variation0.3641851238
Kurtosis1.279902444
Mean2.114757322
MAD0.6153609574
Skewness0.5111075733
Sum45706.25
Variance0.5931512887
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=30)
Histogram
Histogram with variable size bins (bins=[0. 0.625 0.875 1.125 1.375 ... 4.125 4.625 5.375 6.125 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.5 5380 24.9%
 
1 3852 17.8%
 
1.75 3048 14.1%
 
2.25 2047 9.5%
 
2 1930 8.9%
 
1.5 1446 6.7%
 
2.75 1185 5.5%
 
3 753 3.5%
 
3.5 731 3.4%
 
3.25 589 2.7%
 
Other values (20) 652 3.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 10 < 0.1%
 
0.5 4 < 0.1%
 
0.75 72 0.3%
 
1 3852 17.8%
 
1.25 9 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
8 2 < 0.1%
 
7.75 1 < 0.1%
 
7.5 1 < 0.1%
 
6.75 2 < 0.1%
 
6.5 2 < 0.1%
 

bedrooms
Numeric

Distinct count13
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.370841623
Minimum0
Maximum33
Zeros (%)0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile2
Q13
Median3
Q34
95-th percentile5
Maximum33
Range33
Interquartile range1

Descriptive statistics

Standard deviation0.9300618311
Coef of variation0.2759138325
Kurtosis49.06365318
Mean3.370841623
MAD0.7349548336
Skewness1.974299535
Sum72854
Variance0.8650150098
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=13)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 5.5 6.5 7.5 10.5 33. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 9824 45.5%
 
4 6882 31.8%
 
2 2760 12.8%
 
5 1601 7.4%
 
6 272 1.3%
 
1 199 0.9%
 
7 38 0.2%
 
8 13 0.1%
 
0 13 0.1%
 
9 6 < 0.1%
 
Other values (3) 5 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13 0.1%
 
1 199 0.9%
 
2 2760 12.8%
 
3 9824 45.5%
 
4 6882 31.8%
 

Maximum 5 values

ValueCountFrequency (%) 
33 1 < 0.1%
 
11 1 < 0.1%
 
10 3 < 0.1%
 
9 6 < 0.1%
 
8 13 0.1%
 

condition
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.40942951
Minimum1
Maximum5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q13
Median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range1

Descriptive statistics

Standard deviation0.6507430464
Coef of variation0.1908656696
Kurtosis0.5257635653
Mean3.40942951
MAD0.5607190317
Skewness1.032804637
Sum73688
Variance0.4234665124
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
3 14031 64.9%
 
4 5679 26.3%
 
5 1701 7.9%
 
2 172 0.8%
 
1 30 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 30 0.1%
 
2 172 0.8%
 
3 14031 64.9%
 
4 5679 26.3%
 
5 1701 7.9%
 

Maximum 5 values

ValueCountFrequency (%) 
5 1701 7.9%
 
4 5679 26.3%
 
3 14031 64.9%
 
2 172 0.8%
 
1 30 0.1%
 

date
Categorical

Distinct count372
Unique (%)1.7%
Missing (%)0.0%
Missing (n)0
20140623T000000
 
142
20140625T000000
 
131
20140626T000000
 
131
Other values (369)
21209
ValueCountFrequency (%) 
20140623T000000 142 0.7%
 
20140625T000000 131 0.6%
 
20140626T000000 131 0.6%
 
20140708T000000 127 0.6%
 
20150427T000000 126 0.6%
 
20150325T000000 123 0.6%
 
20150422T000000 121 0.6%
 
20140709T000000 121 0.6%
 
20150414T000000 121 0.6%
 
20150428T000000 121 0.6%
 
Other values (362) 20349 94.2%
 
Max length15
Mean length15
Min length15
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

floors
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1.494308981
Minimum1
Maximum3.5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range1

Descriptive statistics

Standard deviation0.5399888951
Coef of variation0.361363615
Kurtosis-0.4847229368
Mean1.494308981
MAD0.4885226404
Skewness0.6161767212
Sum32296.5
Variance0.2915880069
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%) 
1 10680 49.4%
 
2 8241 38.1%
 
1.5 1910 8.8%
 
3 613 2.8%
 
2.5 161 0.7%
 
3.5 8 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 10680 49.4%
 
1.5 1910 8.8%
 
2 8241 38.1%
 
2.5 161 0.7%
 
3 613 2.8%
 

Maximum 5 values

ValueCountFrequency (%) 
3.5 8 < 0.1%
 
3 613 2.8%
 
2.5 161 0.7%
 
2 8241 38.1%
 
1.5 1910 8.8%
 

grade
Numeric

Distinct count12
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.656873178
Minimum1
Maximum13
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile6
Q17
Median7
Q38
95-th percentile10
Maximum13
Range12
Interquartile range1

Descriptive statistics

Standard deviation1.175458757
Coef of variation0.1535168116
Kurtosis1.190932077
Mean7.656873178
MAD0.929600303
Skewness0.7711032008
Sum165488
Variance1.381703289
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=12)
Histogram
Histogram with variable size bins (bins=[ 1. 3.5 4.5 5.5 6.5 ... 9.5 10.5 11.5 12.5 13. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 8981 41.6%
 
8 6068 28.1%
 
9 2615 12.1%
 
6 2038 9.4%
 
10 1134 5.2%
 
11 399 1.8%
 
5 242 1.1%
 
12 90 0.4%
 
4 29 0.1%
 
13 13 0.1%
 
Other values (2) 4 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 < 0.1%
 
3 3 < 0.1%
 
4 29 0.1%
 
5 242 1.1%
 
6 2038 9.4%
 

Maximum 5 values

ValueCountFrequency (%) 
13 13 0.1%
 
12 90 0.4%
 
11 399 1.8%
 
10 1134 5.2%
 
9 2615 12.1%
 

id
Numeric

Distinct count21436
Unique (%)99.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4580301521
Minimum1000102
Maximum9900000190
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1000102
5-th percentile512480335
Q12123049194
Median3904930410
Q37308900445
95-th percentile9297300429
Maximum9900000190
Range9899000088
Interquartile range5185851251

Descriptive statistics

Standard deviation2876565571
Coef of variation0.6280297396
Kurtosis-1.260541871
Mean4580301521
MAD2543592458
Skewness0.2433285476
Sum9.899405677e+13
Variance8.274629486e+18
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.00010200e+06 7.60006100e+06 7.60013050e+06 1.15005650e+07 1.15205050e+07 ... 9.83930020e+09 9.83930111e+09 9.84230007e+09 9.84230051e+09 9.90000019e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
795000620 3 < 0.1%
 
2206700215 2 < 0.1%
 
643300040 2 < 0.1%
 
3333002450 2 < 0.1%
 
1995200200 2 < 0.1%
 
1781500435 2 < 0.1%
 
3904100089 2 < 0.1%
 
3323059027 2 < 0.1%
 
6300000226 2 < 0.1%
 
9809000020 2 < 0.1%
 
Other values (21426) 21592 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1000102 2 < 0.1%
 
1200019 1 < 0.1%
 
1200021 1 < 0.1%
 
2800031 1 < 0.1%
 
3600057 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9900000190 1 < 0.1%
 
9895000040 1 < 0.1%
 
9842300540 1 < 0.1%
 
9842300485 1 < 0.1%
 
9842300095 1 < 0.1%
 

lat
Numeric

Distinct count5034
Unique (%)23.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean47.56005252
Minimum47.1559
Maximum47.7776
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum47.1559
5-th percentile47.3103
Q147.471
Median47.5718
Q347.678
95-th percentile47.74964
Maximum47.7776
Range0.6217
Interquartile range0.207

Descriptive statistics

Standard deviation0.1385637102
Coef of variation0.002913447377
Kurtosis-0.6763130016
Mean47.56005252
MAD0.1148297137
Skewness-0.4852704765
Sum1027915.415
Variance0.0191999018
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[47.1559 47.18955 47.19365 47.19585 47.2141 ... 47.70015 47.73735 47.74675 47.75945 47.7776 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47.5491 17 0.1%
 
47.6846 17 0.1%
 
47.6624 17 0.1%
 
47.5322 17 0.1%
 
47.6711 16 0.1%
 
47.6886 16 0.1%
 
47.6955 16 0.1%
 
47.686 15 0.1%
 
47.6647 15 0.1%
 
47.6904 15 0.1%
 
Other values (5024) 21452 99.3%
 

Minimum 5 values

ValueCountFrequency (%) 
47.1559 1 < 0.1%
 
47.1593 1 < 0.1%
 
47.1622 1 < 0.1%
 
47.1647 1 < 0.1%
 
47.1764 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
47.7776 3 < 0.1%
 
47.7775 3 < 0.1%
 
47.7774 1 < 0.1%
 
47.7772 3 < 0.1%
 
47.7771 2 < 0.1%
 

long
Numeric

Distinct count752
Unique (%)3.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean-122.2138964
Minimum-122.519
Maximum-121.315
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum-122.519
5-th percentile-122.387
Q1-122.328
Median-122.23
Q3-122.125
95-th percentile-121.979
Maximum-121.315
Range1.204
Interquartile range0.203

Descriptive statistics

Standard deviation0.1408283424
Coef of variation-0.001152310388
Kurtosis1.049500887
Mean-122.2138964
MAD0.1151608925
Skewness0.8850529834
Sum-2641408.943
Variance0.01983262202
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[-122.519 -122.466 -122.442 -122.4155 -122.4125 ... -121.7685 -121.7435 -121.6945 -121.411 -121.315 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-122.29 116 0.5%
 
-122.3 111 0.5%
 
-122.362 104 0.5%
 
-122.291 100 0.5%
 
-122.372 99 0.5%
 
-122.363 99 0.5%
 
-122.288 98 0.5%
 
-122.357 96 0.4%
 
-122.284 95 0.4%
 
-122.365 94 0.4%
 
Other values (742) 20601 95.3%
 

Minimum 5 values

ValueCountFrequency (%) 
-122.519 1 < 0.1%
 
-122.515 1 < 0.1%
 
-122.514 1 < 0.1%
 
-122.512 1 < 0.1%
 
-122.511 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
-121.315 2 < 0.1%
 
-121.316 1 < 0.1%
 
-121.319 1 < 0.1%
 
-121.321 1 < 0.1%
 
-121.325 1 < 0.1%
 

price
Numeric

Distinct count4028
Unique (%)18.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean540088.1418
Minimum75000
Maximum7700000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum75000
5-th percentile210000
Q1321950
Median450000
Q3645000
95-th percentile1156480
Maximum7700000
Range7625000
Interquartile range323050

Descriptive statistics

Standard deviation367127.1965
Coef of variation0.6797542255
Kurtosis34.58554043
Mean540088.1418
MAD233941.7243
Skewness4.024069145
Sum1.167292501e+10
Variance1.347823784e+11
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 75000. 109750. 149950. 150275. 159997.5 ... 2002500. 2587500. 3202000. 3825000. 7700000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
450000 172 0.8%
 
350000 172 0.8%
 
550000 159 0.7%
 
500000 152 0.7%
 
425000 150 0.7%
 
325000 148 0.7%
 
400000 145 0.7%
 
375000 138 0.6%
 
300000 133 0.6%
 
525000 131 0.6%
 
Other values (4018) 20113 93.1%
 

Minimum 5 values

ValueCountFrequency (%) 
75000 1 < 0.1%
 
78000 1 < 0.1%
 
80000 1 < 0.1%
 
81000 1 < 0.1%
 
82000 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
7700000 1 < 0.1%
 
7062500 1 < 0.1%
 
6885000 1 < 0.1%
 
5570000 1 < 0.1%
 
5350000 1 < 0.1%
 

sqft_above
Numeric

Distinct count946
Unique (%)4.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1788.390691
Minimum290
Maximum9410
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum290
5-th percentile850
Q11190
Median1560
Q32210
95-th percentile3400
Maximum9410
Range9120
Interquartile range1020

Descriptive statistics

Standard deviation828.0909777
Coef of variation0.4630369538
Kurtosis3.402303621
Mean1788.390691
MAD640.3860357
Skewness1.446664473
Sum38652488
Variance685734.6673
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 290. 465. 575. 665. 695. ... 4505. 4865. 5485. 6690. 9410.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 212 1.0%
 
1010 210 1.0%
 
1200 206 1.0%
 
1220 192 0.9%
 
1140 184 0.9%
 
1400 180 0.8%
 
1060 178 0.8%
 
1180 177 0.8%
 
1340 176 0.8%
 
1250 174 0.8%
 
Other values (936) 19724 91.3%
 

Minimum 5 values

ValueCountFrequency (%) 
290 1 < 0.1%
 
370 1 < 0.1%
 
380 1 < 0.1%
 
384 1 < 0.1%
 
390 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9410 1 < 0.1%
 
8860 1 < 0.1%
 
8570 1 < 0.1%
 
8020 1 < 0.1%
 
7880 1 < 0.1%
 

sqft_basement
Numeric

Distinct count306
Unique (%)1.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean291.5090455
Minimum0
Maximum4820
Zeros (%)60.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q3560
95-th percentile1190
Maximum4820
Range4820
Interquartile range560

Descriptive statistics

Standard deviation442.5750427
Coef of variation1.518220616
Kurtosis2.715574211
Mean291.5090455
MAD363.2358668
Skewness1.577965056
Sum6300385
Variance195872.6684
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 5. 45. 95. 135. ... 1605. 1875. 2230. 2830. 4820.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 13126 60.7%
 
600 221 1.0%
 
700 218 1.0%
 
500 214 1.0%
 
800 206 1.0%
 
400 184 0.9%
 
1000 149 0.7%
 
900 144 0.7%
 
300 142 0.7%
 
200 108 0.5%
 
Other values (296) 6901 31.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 13126 60.7%
 
10 2 < 0.1%
 
20 1 < 0.1%
 
40 4 < 0.1%
 
50 11 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
4820 1 < 0.1%
 
4130 1 < 0.1%
 
3500 1 < 0.1%
 
3480 1 < 0.1%
 
3260 1 < 0.1%
 

sqft_living
Numeric

Distinct count1038
Unique (%)4.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2079.899736
Minimum290
Maximum13540
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum290
5-th percentile940
Q11427
Median1910
Q32550
95-th percentile3760
Maximum13540
Range13250
Interquartile range1123

Descriptive statistics

Standard deviation918.440897
Coef of variation0.4415794093
Kurtosis5.24309299
Mean2079.899736
MAD698.3239196
Skewness1.471555427
Sum44952873
Variance843533.6814
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 290. 510. 665. 695. 804.5 ... 4755. 5560. 6077.5 8015. 13540. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 138 0.6%
 
1400 135 0.6%
 
1440 133 0.6%
 
1010 129 0.6%
 
1660 129 0.6%
 
1800 129 0.6%
 
1820 128 0.6%
 
1480 125 0.6%
 
1720 125 0.6%
 
1540 124 0.6%
 
Other values (1028) 20318 94.0%
 

Minimum 5 values

ValueCountFrequency (%) 
290 1 < 0.1%
 
370 1 < 0.1%
 
380 1 < 0.1%
 
384 1 < 0.1%
 
390 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
13540 1 < 0.1%
 
12050 1 < 0.1%
 
10040 1 < 0.1%
 
9890 1 < 0.1%
 
9640 1 < 0.1%
 

sqft_living15
Numeric

Distinct count777
Unique (%)3.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1986.552492
Minimum399
Maximum6210
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum399
5-th percentile1140
Q11490
Median1840
Q32360
95-th percentile3300
Maximum6210
Range5811
Interquartile range870

Descriptive statistics

Standard deviation685.3913043
Coef of variation0.3450154512
Kurtosis1.59709581
Mean1986.552492
MAD536.2192073
Skewness1.108181276
Sum42935359
Variance469761.2399
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 399. 680. 829. 975. 994. ... 3755. 3995. 4325. 4945. 6210.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1540 197 0.9%
 
1440 195 0.9%
 
1560 192 0.9%
 
1500 181 0.8%
 
1460 169 0.8%
 
1580 167 0.8%
 
1610 166 0.8%
 
1800 166 0.8%
 
1720 166 0.8%
 
1620 165 0.8%
 
Other values (767) 19849 91.8%
 

Minimum 5 values

ValueCountFrequency (%) 
399 1 < 0.1%
 
460 2 < 0.1%
 
620 2 < 0.1%
 
670 1 < 0.1%
 
690 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6210 1 < 0.1%
 
6110 1 < 0.1%
 
5790 6 < 0.1%
 
5610 1 < 0.1%
 
5600 1 < 0.1%
 

sqft_lot
Numeric

Distinct count9782
Unique (%)45.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean15106.96757
Minimum520
Maximum1651359
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum520
5-th percentile1800
Q15040
Median7618
Q310688
95-th percentile43339.2
Maximum1651359
Range1650839
Interquartile range5648

Descriptive statistics

Standard deviation41420.51152
Coef of variation2.741815082
Kurtosis285.0778197
Mean15106.96757
MAD13837.26422
Skewness13.06001896
Sum326506890
Variance1715658774
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[5.200000e+02 6.755000e+02 8.635000e+02 1.154500e+03 1.351500e+03 ... 2.178025e+05 2.246055e+05 2.942475e+05 5.061020e+05 1.651359e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 358 1.7%
 
6000 290 1.3%
 
4000 251 1.2%
 
7200 220 1.0%
 
4800 120 0.6%
 
7500 119 0.6%
 
4500 114 0.5%
 
8400 111 0.5%
 
9600 109 0.5%
 
3600 103 0.5%
 
Other values (9772) 19818 91.7%
 

Minimum 5 values

ValueCountFrequency (%) 
520 1 < 0.1%
 
572 1 < 0.1%
 
600 1 < 0.1%
 
609 1 < 0.1%
 
635 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1651359 1 < 0.1%
 
1164794 1 < 0.1%
 
1074218 1 < 0.1%
 
1024068 1 < 0.1%
 
982998 1 < 0.1%
 

sqft_lot15
Numeric

Distinct count8689
Unique (%)40.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean12768.45565
Minimum651
Maximum871200
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum651
5-th percentile1999.2
Q15100
Median7620
Q310083
95-th percentile37062.8
Maximum871200
Range870549
Interquartile range4983

Descriptive statistics

Standard deviation27304.17963
Coef of variation2.138408933
Kurtosis150.76311
Mean12768.45565
MAD10118.66071
Skewness9.506743247
Sum275964632
Variance745518225.3
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[6.510000e+02 9.145000e+02 1.056500e+03 1.168000e+03 1.279500e+03 ... 2.177945e+05 2.180110e+05 2.245555e+05 4.364705e+05 8.712000e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 427 2.0%
 
4000 357 1.7%
 
6000 289 1.3%
 
7200 211 1.0%
 
4800 145 0.7%
 
7500 142 0.7%
 
8400 116 0.5%
 
3600 111 0.5%
 
4500 111 0.5%
 
5100 109 0.5%
 
Other values (8679) 19595 90.7%
 

Minimum 5 values

ValueCountFrequency (%) 
651 1 < 0.1%
 
659 1 < 0.1%
 
660 1 < 0.1%
 
748 2 < 0.1%
 
750 4 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
871200 1 < 0.1%
 
858132 1 < 0.1%
 
560617 1 < 0.1%
 
438213 1 < 0.1%
 
434728 1 < 0.1%
 

view
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2343034285
Minimum0
Maximum4
Zeros (%)90.2%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum4
Range4
Interquartile range0

Descriptive statistics

Standard deviation0.7663175693
Coef of variation3.270620384
Kurtosis10.89302168
Mean0.2343034285
MAD0.4225548992
Skewness3.395749593
Sum5064
Variance0.587242617
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
0 19489 90.2%
 
2 963 4.5%
 
3 510 2.4%
 
1 332 1.5%
 
4 319 1.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 19489 90.2%
 
1 332 1.5%
 
2 963 4.5%
 
3 510 2.4%
 
4 319 1.5%
 

Maximum 5 values

ValueCountFrequency (%) 
4 319 1.5%
 
3 510 2.4%
 
2 963 4.5%
 
1 332 1.5%
 
0 19489 90.2%
 

waterfront
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
21450
1
 
163
ValueCountFrequency (%) 
0 21450 99.2%
 
1 163 0.8%
 

yr_built
Numeric

Distinct count116
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1971.005136
Minimum1900
Maximum2015
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1900
5-th percentile1915
Q11951
Median1975
Q31997
95-th percentile2011
Maximum2015
Range115
Interquartile range46

Descriptive statistics

Standard deviation29.3734108
Coef of variation0.01490275711
Kurtosis-0.6574075047
Mean1971.005136
MAD24.56566156
Skewness-0.4698053988
Sum42599334
Variance862.7972622
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1900. 1900.5 1904.5 1909.5 1910.5 ... 2009.5 2011.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2014 559 2.6%
 
2006 454 2.1%
 
2005 450 2.1%
 
2004 433 2.0%
 
2003 422 2.0%
 
2007 417 1.9%
 
1977 417 1.9%
 
1978 387 1.8%
 
1968 381 1.8%
 
2008 367 1.7%
 
Other values (106) 17326 80.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1900 87 0.4%
 
1901 29 0.1%
 
1902 27 0.1%
 
1903 46 0.2%
 
1904 45 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 38 0.2%
 
2014 559 2.6%
 
2013 201 0.9%
 
2012 170 0.8%
 
2011 130 0.6%
 

yr_renovated
Numeric

Distinct count70
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean84.4022579
Minimum0
Maximum2015
Zeros (%)95.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range0

Descriptive statistics

Standard deviation401.67924
Coef of variation4.759105384
Kurtosis18.70115212
Mean84.4022579
MAD161.6658804
Skewness4.549493367
Sum1824186
Variance161346.2119
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 967. 1937. 1954.5 1976.5 ... 2007.5 2012.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 20699 95.8%
 
2014 91 0.4%
 
2013 37 0.2%
 
2003 36 0.2%
 
2000 35 0.2%
 
2007 35 0.2%
 
2005 35 0.2%
 
2004 26 0.1%
 
1990 25 0.1%
 
2006 24 0.1%
 
Other values (60) 570 2.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 20699 95.8%
 
1934 1 < 0.1%
 
1940 2 < 0.1%
 
1944 1 < 0.1%
 
1945 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 16 0.1%
 
2014 91 0.4%
 
2013 37 0.2%
 
2012 11 0.1%
 
2011 13 0.1%
 

zipcode
Numeric

Distinct count70
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean98077.9398
Minimum98001
Maximum98199
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
Median98065
Q398118
95-th percentile98177
Maximum98199
Range198
Interquartile range85

Descriptive statistics

Standard deviation53.50502626
Coef of variation0.0005455357888
Kurtosis-0.8534788732
Mean98077.9398
MAD46.72127898
Skewness0.4056612082
Sum2119758513
Variance2862.787835
Memory size169.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[98001. 98001.5 98002.5 98004.5 98005.5 ... 98151.5 98183. 98193. 98198.5 98199. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
98103 602 2.8%
 
98038 590 2.7%
 
98115 583 2.7%
 
98052 574 2.7%
 
98117 553 2.6%
 
98042 548 2.5%
 
98034 545 2.5%
 
98118 508 2.4%
 
98023 499 2.3%
 
98006 498 2.3%
 
Other values (60) 16113 74.6%
 

Minimum 5 values

ValueCountFrequency (%) 
98001 362 1.7%
 
98002 199 0.9%
 
98003 280 1.3%
 
98004 317 1.5%
 
98005 168 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
98199 317 1.5%
 
98198 280 1.3%
 
98188 136 0.6%
 
98178 262 1.2%
 
98177 255 1.2%
 

Correlations

Missing values

Sample

First rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
01.003320141013T0000001.07712930052047.5112-122.257221900.0118001180134056505650001955098178
12.253320141209T0000002.07641410019247.7210-122.319538000.021704002570169072427639001951199198125
21.002320150225T0000001.06563150040047.7379-122.233180000.077007702720100008062001933098028
33.004520141209T0000001.07248720087547.5208-122.393604000.010509101960136050005000001965098136
42.003320150218T0000001.08195440051047.6168-122.045510000.0168001680180080807503001987098074
54.504320140512T0000001.011723755031047.6561-122.0051225000.03890153054204760101930101930002001098053
62.253320140627T0000002.07132140006047.3097-122.327257500.0171501715223868196819001995098003
71.503320150115T0000001.07200800027047.4095-122.315291850.0106001060165097119711001963098198
81.003320150415T0000001.07241460012647.5123-122.337229500.010507301780178074708113001960098146
92.503320150312T0000002.07379350016047.3684-122.031323000.0189001890239065607570002003098038

Last rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
216032.503320140825T0000002.08785214004047.5389-121.881507250.0227002270227055365731002003098065
216042.003320150126T0000003.08983420136747.5699-122.288429000.0149001490140011261230002014098144
216052.504320141014T0000002.09344890021047.5137-122.167610685.0252002520252060236023002014098056
216063.504320150326T0000002.09793600042947.5537-122.3981007500.026009103510205072006200002009098136
216072.503320150219T0000002.08299780002147.5773-122.409475000.011801301310133012941265002008098116
216082.503320140521T0000003.0826300001847.6993-122.346360000.0153001530153011311509002009098103
216092.504320150223T0000002.08660006012047.5107-122.362400000.0231002310183058137200002014098146
216100.752320140623T0000002.07152330014147.5944-122.299402101.0102001020102013502007002009098144
216112.503320150116T0000002.0829131010047.5345-122.069400000.0160001600141023881287002004098027
216120.752320141015T0000002.07152330015747.5941-122.299325000.0102001020102010761357002008098144